CDS

Accession Number TCMCG075C20430
gbkey CDS
Protein Id XP_007025756.1
Location complement(join(24293589..24293707,24294530..24294603,24295037..24295104,24295457..24295549,24296164..24296247,24296918..24296973,24297081..24297191,24297321..24297501,24298260..24298388))
Gene LOC18596932
GeneID 18596932
Organism Theobroma cacao

Protein

Length 304aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007025694.2
Definition PREDICTED: endonuclease 4 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description endonuclease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0000014        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004518        [VIEW IN EMBL-EBI]
GO:0004519        [VIEW IN EMBL-EBI]
GO:0004520        [VIEW IN EMBL-EBI]
GO:0004536        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005576        [VIEW IN EMBL-EBI]
GO:0006139        [VIEW IN EMBL-EBI]
GO:0006259        [VIEW IN EMBL-EBI]
GO:0006725        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016788        [VIEW IN EMBL-EBI]
GO:0034641        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043765        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0046483        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0090304        [VIEW IN EMBL-EBI]
GO:0090305        [VIEW IN EMBL-EBI]
GO:0140097        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGTTTTAGAGGAATGGGTTCGCATGAGCTACTTTGGATTGGAAGGGTGCTTGTTCTTATGTTATTAGTTCATGGGGTAATTGGTTGGGGAAAGGAGGGTCATTATGCTGTTTGCAAGATAGCTGAGGGATATCTTACTGAAGATGCTTTGGCTACAGTAAAAGAACTGCTCCCTGATTCAGCTAAAGGTGAGCTTGCATCTGTATGCTCCTGGCCTGATGATATAAAGTGGTACTACAACTGGCACTGGACTAGTCCCTTACACTATGTTGACACCCCAGATTTAAAGTGTAACTATGAATACTGCCGTGACTGCCATGATCTTGCTGGACATAAAAATATTTGCGTAACTGGAGCAATTTTCAACTATACAAGCCAACTCTTTTCAGCATATCAGGACTATAAGCCCAAGTTGAAATACAATTTGACAGAGGCACTTATGTTCTTAGCTCATTTTATGGGGGATGTCCATCAGCCATTACATGTTGGCTTCACAGGAGATTTAGGTGGAAATACAATCACAGTCCGTTGGTATCGTAGGAAGACAAATCTACACCATGTCTGGGATACCATGATTATTGATTCTGCAGTGAAGACATTCTATGGATCGGATCTTGCAATAATGATTCAAGCCATCCAGAGGAATATTACAGATGCTTGGTCCAATGATATACCATCATGGGAATATTGTGGATATAATCATGCAGTTTGTCCTAACCTGTATGCTTCTGAAAGTGTTGGGTTGGCATGTAAGTTTGCATACAGGAATGCCACGCCTGGAAGCACCTTAGAAGATGATTATTTCCTCTCTCGGTTGCCTATTGTGGAGAAGAGGCTTGCTCAAGGTGGGATTCGCCTTGCTGCAGTGCTCAACCGAATATTTACTTCTGAAGTGAAAATTGCTCGAGCATGA
Protein:  
MGFRGMGSHELLWIGRVLVLMLLVHGVIGWGKEGHYAVCKIAEGYLTEDALATVKELLPDSAKGELASVCSWPDDIKWYYNWHWTSPLHYVDTPDLKCNYEYCRDCHDLAGHKNICVTGAIFNYTSQLFSAYQDYKPKLKYNLTEALMFLAHFMGDVHQPLHVGFTGDLGGNTITVRWYRRKTNLHHVWDTMIIDSAVKTFYGSDLAIMIQAIQRNITDAWSNDIPSWEYCGYNHAVCPNLYASESVGLACKFAYRNATPGSTLEDDYFLSRLPIVEKRLAQGGIRLAAVLNRIFTSEVKIARA